AITopics | virtual camera

Collaborating Authors

virtual camera

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Co-Located VR with Hybrid SLAM-based HMD Tracking and Motion Capture Synchronization

de Sousa, Carlos A. Pinheiro, Gröne, Niklas, Günther, Mathias, Deussen, Oliver

arXiv.org Artificial IntelligenceSep-9-2025

We introduce a multi-user VR co-location framework that synchronizes users within a shared virtual environment aligned to physical space. Our approach combines a motion capture system with SLAM-based inside-out tracking to deliver smooth, high-framerate, low-latency performance. Previous methods either rely on continuous external tracking, which introduces latency and jitter, or on one-time calibration, which cannot correct drift over time. In contrast, our approach combines the responsiveness of local HMD SLAM tracking with the flexibility to realign to an external source when needed. It also supports real-time pose sharing across devices, ensuring consistent spatial alignment and engagement between users. Our evaluation demonstrates that our framework achieves the spatial accuracy required for natural multi-user interaction while offering improved comfort, scalability, and robustness over existing co-located VR solutions.

alignment, artificial intelligence, human computer interaction, (17 more...)

arXiv.org Artificial Intelligence

2509.06582

Country: Europe (0.46)

Genre: Research Report (0.83)

Industry:

Leisure & Entertainment > Games (0.46)
Information Technology (0.46)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
(2 more...)

Add feedback

Environment-Driven Online LiDAR-Camera Extrinsic Calibration

Huang, Zhiwei, Li, Jiaqi, Zhong, Ping, Fan, Rui

arXiv.org Artificial IntelligenceFeb-2-2025

LiDAR-camera extrinsic calibration (LCEC) is the core for data fusion in computer vision. Existing methods typically rely on customized calibration targets or fixed scene types, lacking the flexibility to handle variations in sensor data and environmental contexts. This paper introduces EdO-LCEC, the first environment-driven, online calibration approach that achieves human-like adaptability. Inspired by the human perceptual system, EdO-LCEC incorporates a generalizable scene discriminator to actively interpret environmental conditions, creating multiple virtual cameras that capture detailed spatial and textural information. To overcome cross-modal feature matching challenges between LiDAR and camera, we propose dual-path correspondence matching (DPCM), which leverages both structural and textural consistency to achieve reliable 3D-2D correspondences. Our approach formulates the calibration process as a spatial-temporal joint optimization problem, utilizing global constraints from multiple views and scenes to improve accuracy, particularly in sparse or partially overlapping sensor views. Extensive experiments on real-world datasets demonstrate that EdO-LCEC achieves state-of-the-art performance, providing reliable and precise calibration across diverse, challenging environments.

artificial intelligence, correspondence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2502.00801

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)
Asia > China > Hunan Province (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Shi, Haixin, Hu, Yinlin, Koguciuk, Daniel, Lin, Juan-Ting, Salzmann, Mathieu, Ferstl, David

arXiv.org Artificial IntelligenceMay-10-2024

We propose an approach for reconstructing free-moving object from a monocular RGB video. Most existing methods either assume scene prior, hand pose prior, object category pose prior, or rely on local optimization with multiple sequence segments. We propose a method that allows free interaction with the object in front of a moving camera without relying on any prior, and optimizes the sequence globally without any segments. We progressively optimize the object shape and pose simultaneously based on an implicit neural representation. A key aspect of our method is a virtual camera system that reduces the search space of the optimization significantly. We evaluate our method on the standard HO3D dataset and a collection of egocentric RGB sequences captured with a head-mounted device. We demonstrate that our approach outperforms most methods significantly, and is on par with recent techniques that assume prior information.

optimization, sequence, virtual camera, (15 more...)

arXiv.org Artificial Intelligence

2405.05858

Country: Asia (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.34)

Add feedback

BundledSLAM: An Accurate Visual SLAM System Using Multiple Cameras

Song, Han, Liu, Cong, Dai, Huafeng

arXiv.org Artificial IntelligenceApr-1-2024

Multi-camera SLAM systems offer a plethora of advantages, primarily stemming from their capacity to amalgamate information from a broader field of view, thereby resulting in heightened robustness and improved localization accuracy. In this research, we present a significant extension and refinement of the state-of-the-art stereo SLAM system, known as ORB-SLAM2, with the objective of attaining even higher precision. To accomplish this objective, we commence by mapping measurements from all cameras onto a virtual camera termed BundledFrame. This virtual camera is meticulously engineered to seamlessly adapt to multi-camera configurations, facilitating the effective fusion of data captured from multiple cameras. Additionally, we harness extrinsic parameters in the bundle adjustment (BA) process to achieve precise trajectory estimation.Furthermore, we conduct an extensive analysis of the role of bundle adjustment (BA) in the context of multi-camera scenarios, delving into its impact on tracking, local mapping, and global optimization. Our experimental evaluation entails comprehensive comparisons between ground truth data and the state-of-the-art SLAM system. To rigorously assess the system's performance, we utilize the EuRoC datasets. The consistent results of our evaluations demonstrate the superior accuracy of our system in comparison to existing approaches.

multiple camera, sequence, slam system, (15 more...)

arXiv.org Artificial Intelligence

2403.19886

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > New York > Tompkins County > Ithaca (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.46)

Add feedback

OpenAI's Sora Turns AI Prompts Into Photorealistic Videos

WIREDFeb-15-2024, 18:15:00 GMT

We already know that OpenAI's chatbots can pass the bar exam without going to law school. Now, just in time for the Oscars, a new OpenAI app called Sora hopes to master cinema without going to film school. For now a research product, Sora is going out to a few select creators and a number of security experts who will red-team it for safety vulnerabilities. OpenAI plans to make it available to all wannabe auteurs at some unspecified date, but it decided to preview it in advance. Other companies, from giants like Google to startups like Runway, have already revealed text-to-video AI projects.

openai, sora, video, (11 more...)

WIRED

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.07)

Industry:

Education > Educational Setting > Higher Education (0.55)
Education > Curriculum > Subject-Specific Education (0.55)
Media > Film (0.37)
Information Technology > Security & Privacy (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Multi-log grasping using reinforcement learning and virtual visual servoing

Wallin, Erik, Wiberg, Viktor, Servin, Martin

arXiv.org Artificial IntelligenceJan-24-2024

We explore multi-log grasping using reinforcement learning and virtual visual servoing for automated forwarding in a simulated environment. Automation of forest processes is a major challenge, and many techniques regarding robot control pose different challenges due to the unstructured and harsh outdoor environment. Grasping multiple logs involves various problems of dynamics and path planning, where understanding the interaction between the grapple, logs, terrain, and obstacles requires visual information. To address these challenges, we separate image segmentation from crane control and utilise a virtual camera to provide an image stream from reconstructed 3D data. We use Cartesian control to simplify domain transfer to real-world applications. Since log piles are static, visual servoing using a 3D reconstruction of the pile and its surroundings is equivalent to using real camera data until the point of grasping. This relaxes the limits on computational resources and time for the challenge of image segmentation and allows for collecting data in situations where the log piles are not occluded. The disadvantage is the lack of information during grasping. We demonstrate that this problem is manageable and present an agent that is 95% successful in picking one or several logs from challenging piles of 2--5 logs.

agent, camera data, grapple, (17 more...)

arXiv.org Artificial Intelligence

2309.02997

Country: Europe > Sweden > Västerbotten County > Umeå (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Semi-Synthetic Dataset Augmentation for Application-Specific Gaze Estimation

Leblond-Menard, Cedric, Picard-Krashevski, Gabriel, Achiche, Sofiane

arXiv.org Artificial IntelligenceOct-27-2023

Although the number of gaze estimation datasets is growing, the application of appearance-based gaze estimation methods is mostly limited to estimating the point of gaze on a screen. This is in part because most datasets are generated in a similar fashion, where the gaze target is on a screen close to camera's origin. In other applications such as assistive robotics or marketing research, the 3D point of gaze might not be close to the camera's origin, meaning models trained on current datasets do not generalize well to these tasks. We therefore suggest generating a textured tridimensional mesh of the face and rendering the training images from a virtual camera at a specific position and orientation related to the application as a mean of augmenting the existing datasets. In our tests, this lead to an average 47% decrease in gaze estimation angular error.

application, dataset, gaze estimation, (15 more...)

arXiv.org Artificial Intelligence

2310.18469

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.88)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.34)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.31)

Add feedback

Avatarm: an Avatar With Manipulation Capabilities for the Physical Metaverse

Villani, Alberto, Cortigiani, Giovanni, Brogi, Bernardo, D'Aurizio, Nicole, Baldi, Tommaso Lisini, Prattichizzo, Domenico

arXiv.org Artificial IntelligenceMar-27-2023

Metaverse is an immersive shared space that remote users can access through virtual and augmented reality interfaces, enabling their avatars to interact with each other and the surrounding. Although digital objects can be manipulated, physical objects cannot be touched, grasped, or moved within the metaverse due to the lack of a suitable interface. This work proposes a solution to overcome this limitation by introducing the concept of a Physical Metaverse enabled by a new interface named "Avatarm". The Avatarm consists in an avatar enhanced with a robotic arm that performs physical manipulation tasks while remaining entirely hidden in the metaverse. The users have the illusion that the avatar is directly manipulating objects without the mediation by a robot. The Avatarm is the first step towards a new metaverse, the "Physical Metaverse", where users can physically interact each other and with the environment.

artificial intelligence, human computer interaction, physical metaverse, (14 more...)

arXiv.org Artificial Intelligence

2303.15187

Country:

Europe > Italy (0.04)
Asia > Middle East > UAE > Dubai Emirate > Dubai (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.48)

Add feedback

Tesla patents virtualization and machine learning software to improve FSD

#artificialintelligenceFeb-27-2023, 08:53:31 GMT

Tesla has applied for a set of patents that are set to significantly improve virtualization, recognition, and Full Self Driving overall. Tesla has worked tirelessly to improve full self-driving technology in the first two months of the year. Most recently, Tesla pushed its most significant improvement to employees, v11.3. Still, with new patented technology, the software is set to continue to improve dramatically this year. The two patents, focusing on virtualization and machine learning, appeared in the U.S. Patent Office database late last week.

patent, tesla, vehicle, (9 more...)

#artificialintelligence

Country: North America > United States (0.26)

Genre: Personal > Interview (0.32)

Industry:

Law > Intellectual Property & Technology Law (0.78)
Education > Educational Technology > Educational Software > Computer Based Training (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.67)

Add feedback

Tesla's Self Driving Algorithm Explained

#artificialintelligenceMay-27-2022, 17:45:48 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. On Tesla AI Day, Andrej Karpathy -- the director of AI and Autopilot Vision at Tesla -- enlightened us with a presentation about their self-driving neural network.

neural network, queue, tesla, (14 more...)

#artificialintelligence

Industry: Transportation (0.31)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.72)
Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback